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1 Introduction 

Survival analysis has been the focus of many contributions to Bayesian nonparamet- 
ric theory and practice. Indeed, many statistical problems arising in the framework 
of survival analysis require function estimation and, hence, they are ideally suited 
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for a nonpar ametric treatment. Essentially, two closely related lines of research 
have been pursued: the first is represented by the introduction of models for the 
random cumulative distribution function whereas the second deals with models for 
the random hazard rate and the random cumulative hazard. As for the former most 
proposals fall within the class of neutral to the right processes due to Doksum (1974): 
among others, we mention Ferguson (1974), Ferguson and Phadia (1979), Walker 
and Muliere (1997), Walker and Damien (1998), Epifani, Lijoi and Priinster (2003), 
James (2006). As for the latter, one can distinguish models leading to a cumula- 
tive hazard which is almost surely discrete and models for which it is almost surely 
absolutely continuous. The famous beta process derived in Hjort (1990) belongs to 
the first class along the contributions of, e.g., Kalbfleisch (1978), Kim (1999), Kim 
and Lee (2003), De Blasi and Hjort (2006). The second class focuses on the hazard 
rate which is modeled as a mixture and has recently received much attention due 
to a relatively simple implementation in applications. After the seminal papers of 
Dykstra and Laud (1981) and Lo and Weng (1989), important developments deal- 
ing also with more general multiplicative intensity models can be found in. Laud, 
Smith and Damien (1996), Ibrahim, Chen and Mac Eachern (1999), James (2003), 
Ishwaran and James (2004), Nieto-Barajas and Walker (2004, 2005), James (2005), 
Ho (2006), among others. Passing from a hazard rate function to the corresponding 
model for the cumulative distribution function is straightforward if the hazard rate 
is almost surely absolutely continuous, but quite subtle otherwise. See Hjort (1990) 
and James (2006), who establishes a nice link via the notion of spatial neutral to 
the right process. It is also worth noting that all models share a common feature, 
namely, that their basic building block is represented by an increasing additive pro- 
cess (see Sato, 1999) or more generally by a completely random measure, a notion 
introduced in Kingman (1967). 

Let us focus attention on hazard rates that are modeled as mixtures. Denote 
hy U sl positive absolutely continuous random variable representing the lifetime and 
assume that its random hazard rate is of the form 

h{t) = / k{t,x)fl{dx), (1) 

where A; is a kernel and jl a completely random measure on some space X. The cu- 
mulative hazard is then given by H{t) = Jq h{s)ds. Note that, given jl, h represents 
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the hazard rate of U, that is 



h{t) dt = F{t<U <t + dt\U > t,Jj). 



From ([T]), provided H[t) oo for t ^ oo almost surely, one can define a random 
density function / as 

fit) = h{t) eM-H{t)) 

where S{t) := exp{—H{t)) is the so-called survival function providing the probability 
that U > t. Consequently the random cumulative distribution function of U is of the 
form F{t) = 1 — exp{—H{t)). Such models, often referred to as life-testing models, 
have been considered in Dykstra and Laud (1981) and Lo and Weng (1989) with jl 
being an extended gamma process, also known as weighted gamma process. Nieto- 
Barajas and Walker (2004), instead, used a weighted version of a gamma compound 
Poisson process. Analysis beyond gamma-like choices of fi was not possible due 
to the lack of a suitable and implementable posterior characterization: however, 
in James (2005) this goal has been achieved and many choices for jl can now be 
explored. See also Ho (2006) for a posterior characterization via S-paths. 

In this paper, we provide asymptotic results for random hazard rates constructed 
via a mixture approach as in ([T|). In particular, for i = 1,2, 3, we will be interested 
in establishing the existence of two positive functions Tj (T) and iji (T) such that the 
following Central Limit Theorems (CLTs in the sequel) take place as T ^ +oo: 

law 



Vi{T)x H{T)-n{T) 



m (T) X 



- / HtYdt - T2 (T) 


■T 



law 



X2 (a2) 







h{t)-H{T)/T dt-n{T) 



law 



^3 (tT3) 



(2) 
(3) 

(4) 



where, for i = 1,2,3, Xi (uj) is a centered Gaussian random variable, with variance 
(Tj depending on the analytic structures of /i and k. For a fixed T > 0, the random 
objects T^^ Jq h{t)'^dt and T^^ h{t) — H{T)/T dt are called, respectively, the 
(realized) path-second moment and the (realized) path-variance associated with h. 
As we will point out in the subsequent sections, weak convergence results such as 
©J dSD and ([1]) give a description of the overall variability of the hazard rate h[t), by 
providing a synthetic answer to the following crucial questions: (i) "How fast does 
the cumulative hazard rate diverge from its long-term trend ri (T)?", (ii) "How fast 



3 



increases the magnitude of the fluctuations of h{t) above zero?", and (iii) "How big 
are the oscillations of h{t) around its average value?". To the authors knowledge, 
this represents a completely new line of research. Indeed, by now, many results have 
been obtained in terms of consistency of posterior distributions. See Ghosh and 
Ramamoorthi (2003) for an exhaustive account. However, little is known about the 
distributional behavior of the prior ingredients of a Bayesian nonparametric model 
such as ([T|), in particular with reference to functionals of statistical relevance. In 
the more conventional setup of random probability measures, instead of the one 
concerning hazard rates considered here, the first results on linear functionals of the 
Dirichlet process were achieved in the pioneering paper of Cifarelli and Regazzini 
(1990), whereas the variance functional is studied in Cifarelli and Melilli (2000) and 
Regazzini, Guglielmi and Di Nunno (2002). One may try to adopt the approach of 
Regazzini, Lijoi and Priinster (2003) based on Gurland's inversion formula to derive 
expressions for the distribution of linear functionals of general random hazards as 
in ([T|), but to tackle quadratic functionals seems impossible to date. In light of 
these considerations, it seems important to remark that, despite the theoretical 
relevance of our asymptotic results, they also turn out to be helpful in terms of prior 
specification: on one hand they can serve as a guide for deciding on which particular 
completely random measure jl basing the model ([1]) and on the other hand, once 
/2 is chosen, provide hints for selecting the parameters of jl. Indeed, up to now 
these two steps were carried out in a conventional way, leaving aside the problem 
of properly incorporating prior knowledge, in particular with respect to the choice 
of jl. A first contribution highlighting the different clustering behaviors induced 
by alternative random measures in the context of mixtures for Bayesian density 
estimation is provided in Lijoi, Mena and Priinster (2005). See also Ishwaran and 
James (2001). 

The paper is structured as follows. In Section 2 we introduce some basic concepts 
and notations. In Section 3 we state the main results concerning linear and quadratic 
functionals of random hazard rates. In particular, we derive CLTs for the cumulative 
hazard function and for the path-second moment and path-variance of the hazard 
rate. Moreover, we provide a useful comparison theorem which allows to bypass 
the verification of the most delicate conditions thus leading to obtain CLTs for 
hazard rates based on complex kernels or random measures. Section 4 is devoted 
to applications: we consider specific models involving kernels and random measures 
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commonly exploited in practice and analyze their asymptotic behavior in detail. In 
Section 5 the proofs of our results are provided and the techniques used to establish 
them are illustrated. Section 5 contains some concluding remarks together with 
possible extensions and an outline of future work. 



2 Basic concepts and notations 

We start by introducing the main concepts and notations employed throughout the 
paper. Consider a measure space (X, where X is a complete and separable metric 
space and ^ is the usual Borel cr-field. Introduce a Poisson random measure N, 
defined on some probability space (fi,^,P) and taking values in the set of non- 
negative counting measures on (IR,+ x X, ^(]R+) with non-atomic intensity- 
measure v, i.e. 



E 



N{dv,dx) = ^{dvjdx) 



and, for any A G =^(]R+) (g) such that i^iA) < oo, N{A) is a Poisson random 
variable of parameter i^{A). Moreover, given any finite collection of pairwise dis- 
joint sets, Ai, . . . , Ak, in ^{'R'^) (gi the random variables N{Ai), . . . , N{Ak) are 
mutually independent. Throughout the paper, E[ • ] will denote expectation with 
respect to P. Moreover, the intensity measure v must satisfy 

/ A l)z^(di;,X) < oo 

where aAb = min{a, b}. See Daley and Vere-Jones (1988) for an exhaustive account 
on Poisson random measures. 

Recall that, according e.g. to Daley and Vere-Jones (1988), a Borel measure /x on 
some Polish space endowed with the Borel cr-algebra is said to be boundedly Gnite if 
n{A) < +00 for every bounded measurable set A. Let now (M, ^(M)) be the space 
of boundedly finite measures on (X, ^(X)). We suppose that M is equipped with 
the topology of weak convergence and that ^(M) is the corresponding Borel cr-field. 
Let /i be a random element, defined on (O,^, P) and with values in (M, ^(M)), 
which can be represented as a linear functional of the Poisson random measure N 
(with intensity u) as follows 

fiiB)= [ sN{ds,dx) VSg^(X). 
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It can be easily deduced from the properties of N that jl is, in the terminology of 
Kingman (1967), a completely random measure (CRM) on X, i.e. 

(i) /i(0) = a.s.-P 

(ii) for any collection of disjoint sets in =^(X), Bi, B2, ■ ■ ., the random variables 
fl{Bi), p:{B2), ■ ■ ■ are mutually independent and p,{Uj>iBj) = J2j>if^{^j) 
holds true a.s.-P. 

Now let be the space of functions g : X ^ R+ such that /]a+xx[l " e'^^^^^] 
z^(ds, dx) < 00. Then, the law of fl is uniquely characterized by its Laplace functional 
which, for any g in '^u, is given by 



For details and further references on CRMs see Kingman (1993). From ([5]) it is 
apparent that the law of the CRM fl is completely determined by the corresponding 
intensity measure 1^. This suggests a simple and useful distinction of the random 
measures we deal with, according to the decomposition of 1^. Letting A be a non- 
atomic and cT-finite measure on X, we have: 

(a) if z^(d?;,dx) = p{dv) A(dx), for some measure p on R"*", we say that the corre- 
sponding N and p, are homogeneous; 

(b) if i^{dv,dx) = p{dv\x) X{dx) , where p : ^(R"^) x X ^ 11+ is a kernel (i.e. 
X I— > p{C\x) is ^(X)-measurable for any C £ ,^(11'^) and p{ ■ \x) is a cr-finite 
measure on ^(R+) for any x in X), we say that the corresponding random 
measures and p are non-homogeneous. 

In the sequel we consider CRM p whose intensity measures satisfy 



In the homogeneous case, ()Hip reduces to max{ p(R+) ; A(X) } = +00, which is 
tantamount of requiring either infinite activity of p i.e. p jumping infinitely often 
on any bounded ^ G ^ or to consider p with unbounded support S such that 
A (S) = +00. In the non-homogeneous case, for (|Hip to hold it is enough that p 
jumps infinitely often on some bounded set of positive A~measure. It is clear that 






(Hi) 
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()Hip is met by the CRM usually considered in the literature. In the subsequent 
sections, as illustrations of our general results, we will consider the following CRMs: 



1. Generalized gamma CRM: its intensity measure is homogeneous and given by 

1 e~'^^ 

where a G (0, 1) and 7 > 0. This class, studied in Brix (1999), can be charac- 
terized as the tilted exponential family generated by the positive stable laws. 
It includes the inverse Gaussian CRM for a = 1/2 and the gamma CRM as 

2. Extended gamma CRM: its non-homogeneous intensity measure is of the form 

^{dv, dx) = dv A(dx) (7) 

where /? is a strictly positive function on X. This class dates back to Dykstra 
and Laud (1981) and Lo and Weng (1989). The gamma CRM arises if /? is 
constant. 

3. Beta CRM: its non-homogeneous intensity measure is given by 

u{dv, dx) = 1(0,1) (v) c{x) ~ ^ d^; A(dx) (8) 

where c is some strictly positive function on X and Ia stands for the indicator 
function of set A. Note that the class of beta CRM, which is due to Hjort 
(1990), has the particularity of allowing only jumps of sizes less than 1. 

Having settled the basics regarding the background driving CRM in ([T]), we now 
have to define the kernel: A: is a jointly measurable application from 1R,+ x X to 111+ , 
such that Jj^k{t, x)X{dx) < +00 and /. k(t\x)dt is a c-finite measure on ^{H'^) for 
any x in X. Given these two ingredients the random hazard rate in ([1]) is properly 
defined. 

A further technical assumption we will make throughout the paper is represented 
by the following conditions 

/ k {t, xy p{dv\x) X{dx) < +00 yt, j = 1, 2, 4; (H2) 
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/ / k{t,xy p{dv\x)X{dx)dt <+oo VT > 0, j = 1,2,4. 

J[0,T] Jr+xX 

If, for j = 1,2,4, the apphcation x ^ J^+v^p{dv\x) is bounded by some finite 
constant (which is typicahy the case), then the first condition in (jH2|) reduces to 
requiring that the function x ^ k{t, xY is integrable with respect to A for every t, 
whereas the second fine of ()H2p boils down to the assumption that the apphcation 
{t,x) ^ k{t,x) is an element of □^=1^2,4-^-' ([0, T] x X, dtA(d2;)) for every T > 0. 
Hence, in the uniformly bounded case ()H2p is a condition not involving the CRM, 
but just the kernel. Moreover, it is easy to see that the quantity p{dv\x) , 

j = 1,2,4, is bounded in x whenever p{dv\x) is associated to one of the three 
classes of CRMs defined above (see ([U]), d?]) and ([5])). We shall also note that, in the 
homogeneous case, ()H2p implies that p{dv) < +00, j = 1,2,4. An example 

of a homogeneous CRM, which does not meet (|H2p is the stable CRM for which 
p{dv) = v-^-'^dv and a G (0,1). Note that the stable CRM can be recovered 
from the generalized gamma class by allowing 7 = in ([6]): we have excluded this 
possibility since it does not meet ()H2p . 



2.1 Further notation 

For q,p > 1, we note 

LP {u") = LP ((R+ X X)", {^(R+) jr)9, u") 

the Banach space of real-valued functions / on (1R+ x X)'^, such that |/|^ is integrable 
with respect to u'^ := v®'^. We will systematically write [v^^^ = {v) for p>l. 
The symbol (z^^) is used to denote the subspace of (i/^) composed of symmetric 
functions on (R^ x X)^. By symmetric, we mean that every f £ (z^'^) is such that 
/ (s, x; t,y) = f (t, y; s, x) for every (s, x) , {t, y) G x X. We also write Li q (z^^) 
to indicate the subset of (z^^) composed of symmetric functions vanishing on 
diagonals, i.e. such that their support is contained in the purely non-diagonal set 
Dl = {{s,x;t,y) : {s,x) / {t,y)}. 

We now turn to the definition of three basic auxiliary kernels which are associated 
to a given / G Lg (i^^): (i) the kernel f -k^ f is defined on (1R+ x X)^ and is given by 

f f iti,xi;t2,X2;ts,xs) = f {ti,xi;t2,X2) f {t2,X2;t^,x^) ; (9) 
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(ii) f •k\ f is defined on (R^ x X)^ and is actually a contraction equal to 



f*i f {h,xi;t2,X2) 



R+x3 



f iti,xi;s, y) f (s, y; t2,X2) v (ds, dy) ; (10) 



(iii) / *2 / is defined on (I[l+ x X) and is given by 

f*lf{t,x)= f {t,x;s,yf u{ds,dy) . 

JR+XX 



(11) 



Note that, by the Cauchy-Schwarz inequality and by the symmetry and square- 
integrability of /, the kernel / / is necessarily an element of (z^'^)- The three 
kernels defined above are the fundamental building blocks to obtain explicit expres- 
sions for the moments and the cumulants of the linear and quadratic functionals 
associated with random hazard rates (when they exist). Such expressions enter 
implicitly in the statements of the subsequent results, and are mainly of a combina- 
torial nature. We refer the reader to Rota and Wallstrom (1997) for an exhaustive 
analysis of the combinatorial machinery underlying the construction of stochastic 
integrals with respect to completely random measures. 

In the subsequent sections it will be often convenient to work with the com- 
pensated Poisson random measure canonically associated to N. Such an object is 
indicated by 

iV^ = |iV'=(A) : ^ G ^(R+) jrj , (12) 
and is formally defined as the unique CRM on (R+ x X, ^(R"*") J^T) such that 



iV^ (A) =N{A)-u (A) 
for every set A of finite z/- measure. For every g ^ (v), we denote by 



(13) 



N^g) 



R+xl 



g (s,x) iV^ (ds,dx) 



the Wiener-Ito integral of g with respect to N'^. We recall that, for every g £ L"^ (u), 
(g) is a centered and square integrable random variable with an infinitely divisible 
law, such that, for every A E M, 



E 



exp 



R+xX 



JXg{s,x) _ l_iXg(^s,x) \ 1/ (ds, dx) 



(14) 



(compare with dS])). Moreover, for every f,g £ L'^ (v), one has the isometric property 



E 



N^f)N^g) 



R+xl 



f {s, x) g [s, x) u {ds, dx) := (/, g) ^2(^,) 



(15) 
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Note that ([5]), ()14p and the isometric property (jlSp imply that, for every g £ L'^ 



E 



Var 



Nig) 



R+x] 



5 (s, x) z/ (ds, dx) 



(16) 



Var 



N'^ig) = / (7(s,x)'^^(ds,dx). (17) 



R+xX 



3 Main results: CLTs for linear and quadratic function- 
als 

In what fohows, we shall develop several techniques, allowing to study the asymptotic 
behavior of linear and quadratic functionals associated to the random hazard rate 
h{t) appearing in ([1]). Concerning quadratic functionals, we will be mainly interested 
in the path- variance and the path second moment of h(t). As will be clarified 
in Section \5\ our approach exploits the fact that any quadratic functional of h 
can be (uniquely) represented as a linear combination of its expectation and of 
the following two random elements: (i) the stochastic integral of a deterministic 
kernel with respect to A^'^, and (ii) the double Wiener- Ito integral of a deterministic 
bivariate kernel with respect to the stochastic product measure associated to N^. 
According to the results proved in Peccati and Taqqu (2006b) (see Section [5TT]) . the 
joint (weak) convergence of single and double Poisson integrals can be characterized 
in terms of the asymptotic negligibility of deterministic contraction kernels. We will 
show that such contractions are indeed explicit functionals of the kernel k defining 
h. We shall first state the main general results of the paper, and then describe in 
detail several applications. The proofs are deferred to Section [5l 

Consider the random hazard rate h defined in formula ([T|), and assume that 
the intensity of the underlying Poisson CRM A^ verifies ()Hip . and that the positive 
kernel k satisfies ()H2p . Moreover, for every T > define the kernel 

nT 

kP{s,x) = s k{t,x)dt, (s,x)g1R+xX. (18) 
Jo 

Our first result concerns the asymptotic behavior of the cumulative hazard rate 
H{T) = J^h{t)dt. 
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Theorem 1 Suppose that: (i) kip G iy) for every T, and (ii) there exists a 
strictly positive function T ^ Cq {k, T), such that, as T 



Cl{k,T) X 
Cl{k,T) X 



/ 


t\irp 


(s,x) 


/R+xX 






/ 


r\,rp 


{s,x) 


/R+xX 







Then, 



law 



> +00, 

al (k) > 0, 
0. 

X, 



(19) 
(20) 

(21) 



Co{k,T) X [h{T)--E[H{T)] 

where X ~ ^ (O, (A;)) 

Note that conditions (|19p -(|20 p only involve the analytic form of the kernel k, 
and do not make any use the of the asymptotic properties of the law of the process 
h{t), such as e.g. mixing. We now focus on the limiting behavior of the quadratic 
functionals associated to the random hazard rate h. To this end, we associate to 
k (•, •), and to each T > 0, the three auxiliary kernels: 



kj^ ('S; x; t, y) 

.(2) 



St 



T 



— I k{u,x)k{u,y)du, 



k 



S 

T 



k^ {s, x) 



R+xX 



k {u, x)^ dii, 

A;^^^ (s, x; u, w) v (dn, Aw) . 



(22) 

(23) 
(24) 



The kernel k^ can be obtained by restricting k)j,^ to the diagonal set {{s,x;t,y) : 
{s,x) = {t,y)}. We will see in Section [S] that the kernels fc^'' are intimately related 
to the objects defined in formulae ([9])- pT]) . Note that, due to assumption (|H2p 
and the Jensen and Cauchy-Schwarz inequalities, k!^^ £ (i/^) n (z^^) , and also 



(1) 



of random hazard rates. 



k}p' G (v). The following theorem provides a CLT for the path-second moment 



Theorem 2 Suppose that k^ G {v) n {v), k^ G {v) and that there exists 
a strictly positive function Ci (k, T) such that the following asymptotic conditions 
are satisfied as T ^ +oo: 



1. 2Cf{k,T) 

2. Ctik,T) 



.(1) 



Xirp 



2 

L2(l/2) 

4 

L4(;/2) 



al (k) > 0; 



0; 



11 



3. Cfik,T) 

4. Cfik,T) 

5. C!{k,T) 

6. CUk,T) 
Then, 



,1 1.(1) 



.(1) ,1 ^.(1) 



k^? + 2kP 



k^? + 2^5?) 



1 



2 

L2(ly2) 
2 

2 

3 



0; 

> <j2 {k) > 0; 

> 0. 

1 



Ci (fe, T) X ^ / /i(t)^dt - ^ I B[hmdt } ^ X 



(25) 



where X ~ ^ (O, a? (A;) + cr| (A;)) 
Note that 

.(3) 



T 



/ / 4')(„;„.„).(d„,d.).(d..d.) 

Jr+xX JR+xX 



1 



sk{t,x) V {ds,dx) \ dt. 



T Jo \Jn+xX 

Also, by applying formulae (jl6p and (jl7p (for every t > 0) in the case h{s,x) 
sk{t,x), one obtains that 



T 



E[hitf]dt 



T 



\JR+xX 



s/c (f, x) 1/ (ds, dx) ) df 



(26) 



1 




+- I I s^k{t,xy u{ds,dx)dt. 
T Jo Jr+xX 

The next theorem combines Theorem [1] and Theorem [2] to deal with path- 
variances of random hazard rates. 



Theorem 3 Suppose that h is such that assumptions (|19p - ()20p are verified, and 
conditions 1.-6. of Theorem^ are satisfied. If there exists a constant 5 (k) > such 
that, as T ^ +oo, 

1. Ciik,T)/iTCoik,T)f ^0; 

2. 2Ci {k, T) nH{T)]/ {T^Co {k, T)) 6 (k) ; 



Ci {k, T) ( + 2^^ Co {k, T) 4" 



.(0) 



4 (k) > 0, 
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then, 



■■Ci{k,T)x 



hit) 



hit) 



H{T) 



H{T) 



(27) 



1 r 



hit) 



miT)) 



dt 



law 



X, 



where X ~ ^ (O, (A;) + (A;)) . 
In view of (jl7p . one also has that 

i r Yar (hit)) dt = ^ 



s^k it, x)^ (ds, dx) dt. 

^R+xX 

To conclude this subsection, we state a useful comparison theorem for random hazard 
rates. To this end, consider two completely random Poisson measures (on x X) 
N and A^, as well as positive kernels k and k. The cr-finite and non-atomic intensity 
measures of N and N are denoted by z7 and z7, respectively. We assume that z7 and V 
both verify (jHip . and that k and A; satisfy ()H2|) . Finally, we suppose that, for every 
S G (<^(R+) JT), 

I7(i?) <^(i?), 

and, for every (t, x) G x X, 

A; (t, x) < k it, x) . 

Throughout the paper, for strictly positive sequences {an} and {bn}, we write a„ ~ 
6„ if there exists c E (0, -|-(X)) such that an/bn — > c, as n ^ oo. 

Theorem 4 Suppose that the pair iv, k) entering the definition of the random haz- 
ard h in ^ is such that, for every B e (=^(E+) ® .^), V iB) < v (B) < V (B) and, 
for every it,x) G x X, A; (t, x) < A;(t,x) < kit,x). Then, the following three 
comparison criteria hold. 

(A) Assume that the two kernels k and k, with v andV substituting v, satisfy the 
conditions (fTU|) - ([2n|) for some appropriate positive functions Co(A;,T) and Coik,T) 
and constants Cq(A;) and crQ(A;). Suppose also that Co(A;,T) ~ Co(A;,T), and consider 
a positive function Co(A;, T) such that Co(A;, T) ~ Co(A;, T). Then, for every diverging 
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sequence Tn — > +00, there exists a subsequence Tn' such that the CLT (|2ip holds as 
n' — > +00, with Tn' substituting T , where X is a centered Gaussian random variable 
whose variance depends on the choice of Cq {k,T) and on n' . 

(B) Assume that k and k, withV andV substituting v, satisfy conditions 1.-6. of 
Theorem^ for some positive functions Ci{k,T) and Ci{k,T) and constants (7j{k) 
and crj{k), j = 1,2. Assume, moreover, that Ci{k,T) ~ Ci{k,T), and select a 
positive function Ci {k,T) such that Ci {k,T) ~ Ci{k,T). Then, for every sequence 
Tn — > +CXD, there exists a subsequence T„/ such that the CLT (j25p is verified (for 
n' — > +00 and with T„/ substituting T) where X is a centered Gaussian random 
variable whose variance depends on Ci {k,T) and n' . 

(C) Suppose that k, k, Cj{k,T), Cj{k,T) and Cj{k,T) (j = 0,1) satisfy the 
assumptions pinpointed in Parts (A) and (B), and suppose that they also meet the 
Conditions 1.-3. of Theorem^ Then, for every sequence Tn — > +cxd, there exists a 
subsequence Tn' such that the CLT (j27|) holds, for n' +00 and with Tn' substituting 
T. 

Remark. The conclusions of Theorem H] are less precise than those of Theorems 
(THSl in the sense that they only apply to subsequences T„/. Of course, this is due to 
the fact that, in the statement of Theorem [U we do not make any assumption on the 
analytic properties of k and z^, besides the conditions k < k < k and V < ly <V. As 
will become clear in the subsequent sections, more exact information can be deduced 
by adding some specific requirements to the structure of k and i^. 

4 Applications 

We will now consider noteworthy examples of random hazard rates by specifying 
suitable kernels and the form of the background driving CRM. In the following 
we will always consider CRMs with A being the Lebesgue measure on 11+ , which 
appears a natural choice in our context. This implies that Assumption (HI) is met. 
Paragraph 14.11 is devoted to the study of the asymptotic behavior of the cumulative 
hazard H, whereas in Paragraph 14.21 we deal with quadratical functionals of the 
hazard rate. 
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4.1 Asymptotics for the cumulative hazard 

As an illustration of Theorem [H we consider different kernels and show how they 
are responsible for the rate of divergence of the cumulative hazard and how they 
influence the variance of the limiting Gaussian random variable in the CLT (12ip . 
We first consider general homogeneous CRM such that /[^ v'^p{dv) < oo, which is 
tantamount of requiring the part of condition (H2) involving the jump component 
of the Poisson intensity to be satisfied. Moreover, set, for notational convenience, 
^ s'^p{ds), i = 1,2, and /j = Ii{T) = /j^+^x {s,x) i^(ds,dx) for 



i = 1,2,3. Note that h (T) = E[H{T)]. 

(i) Rectangular kernel. The kernel k{t,x) = l(^\t_x\<T) where r > represents 
a bandwidth, is known as uniform rectangular kernel. Such a kernel represents a 
sensible choice when no prior information on the shape of the hazard rate is available. 
See, e.g., Ishwaran and James (2004). In this setup (H2) is clearly met. 



s{x + t) 
s2t 

S[T + T 





X 



< X < T 

T < X < T — T 

T-T <X< T+T 

elsewhere 



and kP{s,x) G L^{u 



for all T > 0. We also have, as T — > +oo, Ii (T) = 
Kj^^ {2Tt - ir2} = 2tKI,^^T + o [T^/^], h (T) ~ AxfK'^T and h {T) ~ cT for 
some c > 0. Hence, dUD and ^ are satisfied with Co{k,T) = T^^/^ and, by 
Theorem [H we obtain 

1 



H{T) - 2tK^^^T 



law 



X, 



(28) 



where X ~ ^ ( 0,4^^^^^ 



(n) Dykstra-Laud kernel. If k {t, x) 



I 



iO<x<t) 



, then the random hazard rate is 



monotone increasing. Such a kernel, which is widely exploited in practice, was first 
proposed in Dykstra and Laud (1981). It is easy to see that (H2) is satisfied and 
that k!^\s,x) = s(T — x)I(o<a;<T) ^ ^^{^) fo'^ ^-ll T > 0. Moreover, one obtains 



K 



(2) 



and I3 = ^T^, so that ([ISD and ([20]) are met with 



(2) 



3 " """" "■^ 4 

Co{k,T) = r-3/2. Hence, by TheoremH we have 



1 

¥1 



H{T) 



K, 



(1) 



law 



X, 



(29) 
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where X ~ ^ I 0, — ^ j . Note that the Dykstra-Laud cumulative hazard has a 
quadratic asymptotic trend, whereas the trend obtained from a rectangular kernel is 
linear. Moreover, the speed at which the Dykstra-Laud cumulative hazard diverges 
from its trend is significantly faster than in the rectangular case. The reason may 
be that the former produces monotone increasing hazard rates whereas the latter 
not. This phenomenon, well exemplified by our result, should be taken into account 
when deciding which kernel to adopt. 

(in) Ornstein-Uhlenbeck kernel. If k {t, x) = -v/2Kexp (— k [t — x)) I(o<2:<t)) then the 
random hazard rate is an Ornstein-Uhlenbeck-type process. Such models for the 
hazard rate are employed in Nieto-Barajas and Walker (2004, 2005). In this case, 
(H2) is met, k!^\s,x) = Sy^il - e-''^^-'''>)\o<x<T) e L^{u) for all T > 0, and 
we have that, as T diverges to infinity, Ii (T) = Kp^^ -s/I/k {T — e^'^ / k + = 
K^^^y^2/^T + o (T^/^^, I2 (T) ~ and h (T) ~ cT for some constant c> 0. 

Hence, (|19p and (j20p are satisfied with Co{k,T) = T^^/^. From Theorem[T]it follows 
that 



1 



^ X, (30) 



where X ~ ^ |^0, =^ j . One may note that the trend and the rate of divergence 
from the trend associated with the Ornstein-Uhlenbeck kernel coincide with those 
arising from the rectangular kernel. Moreover, given the same background driving 
CRM, the variances of the limiting Gaussian random variables appearing in (j28p 
and (I30p coincide if the parameters are chosen in such a way that K = 1/(2t2). 

(iv) U-shaped or hath-tube kernel. If k{t,x) = I(^^t-f3\>x) with /3 > 0, then the 
corresponding hazard rates are U-shaped with minimum at f3. Such a kernel is 
suggested by Lo and Wong (1989). See also James (2003) and Ishwaran and James 
(2004). It is easy to check that (H2) is met. 



s{T -2x) <x < f3 

s[T-{i3 + x)] (3<x<T-/3 
elsewhere 



and kP{s,x) G L^{u) for all T > 0. Moreover, as T ^ +00, h {T) = ^K^p^T"^ + 
o (T^/2)^ I2 ~ — and /s ~ cT^ for some constant c > 0. Choosing Co{k,T) = 
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T pop and (j20p are satisfied and from Tlieorem [T] we deduce 



ri 



^ X, (31) 



where X ~ ^0, J • Note that the bath-tube kernel produces the same asymp- 
totic behaviour of the Dykstra and Laud kerneh this fact is not surprising since after 
reaching its minimum in /3, also the bath-tube kernel is monotone increasing. Of 
course, one can regard the Dykstra and Laud kernel as a degenerate bath-tube 
kernel, corresponding to the case /? = 0. 

As apparent from the statement of Theorem [1] and from the discussion provided 
above, the variances of the limiting Gaussian random variables appearing in (j2ip . 
([28]) . (p9]) . ([30j) and ([3T]) . always depend on the jump part of the Poisson intensity. 

.(2 



(2\ 

For instance, if fi is the generalized gamma CRM with intensity ®, then Ky' 



^^^1-} ■ This confirms the empirical finding, used in tuning the prior parameters, that 
a small 7 induces a large variance. To avoid confusion, note that in the setting of 
e.g. Ishwaran and James (2004) /3 = I/7 and, hence, their claim that a large /? 
induces a non-informative prior is coherent with our result. As for a, the variance 
is maximal in cr = if 7 < e, whereas it is maximized in cr = (log(7) — l)/log(7) if 
7 > e. 

Let us now turn attention to hazards based on non-homogeneous CRM, specif- 
ically the extended gamma and beta CRMs presented in Section 2. From ([7]) and 
([8]) one can see that their non-homogeneity is due to the strictly positive functions 
P and c, respectively. According to their structure we distinguish three cases: (a) 
if /3(x) = /3 in d?]) and c(x) = c in ([8]), the CRMs become homogeneous and the 
previous results hold with Kp^^ equal to 1//?^ and 1/(1 + c), respectively, (b) If f3 
(or c) are bounded by some finite constant M, then one can apply Theorem 4 to 
conclude that Co{k,T) has the same order as in the examples above, thus depend- 
ing on the choice of the kernel. Moreover, if /3 (or c) are eventually non-decreasing 
(non-increasing) the convergence holds for any diverging sequence with the vari- 
ance of the limiting Gaussian random variable depending on the choice of /3 (or c) 
taking value in the range [cJq(/c), crQ(A;)]. (c) If (3 (or c) diverge to -|-oo as x — > -|-oo, 
quite interesting phenomena appear, which shed some light on the possible use of the 
factor of non-homogeneity represented by the functions /3 (or c). Set, for i = 1,2,3, 
Kp\x) = s^p{ds\x), so that /j becomes f-^Kp'\x) k{t,x)dt dx. For both 
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(2) 

CRMs, a diverging /3 (or c) implies that Kp (x) 0: this, indeed, affects the 
asymptotic behavior of the cumulative hazard H. To be more specific, consider 
the Dykstra and Laud kernel combined with an extended gamma CRM such that 
/3(x) ~ ^/x as X ^ oo: it follows that I2 ~ log(T)T^ and ~ dT"^ for some constant 



H{T)-E[H{T)] ^X, (32) 



d> 0. Hence, ([HD and ([20]) are satisfied with Co{k,T) = (y^log(r) T)-^ and, by 
Theorem [H we have 

1 

where X ~ (0,1). Comparing (j32p with ()29p one notes that the rate of di- 
vergence from the trend E[F(r)] is reduced from T^/^ to y^log(r) T. As for 
E[iif(T)], it is important to remark that the overall growth (though not the dom- 
inating term which is 4/3 T'^/'^) depends on the particular form of (3. Still as- 
suming l3{x) ~ y/x and letting 6 be a positive constant, we obtain, for instance, 
E[F(r)] = 4/3T3/2 + o{Ty^h^) when (3{x) = 1(0,6] (x) + x^/%,oo){x), and 

E[^(r)] =4/3r3/2-iog(r)T + o(ryk^^(r)) a p{x) = (i + x^/^). a gain, compar- 
ing these findings with (j29p it is apparent that the trend has been reduced from 
to T^/^ + o(r^/2). On the other hand, with the beta CRM, we have K^^\x) = 1 
and, consequently, Ii{T) = E[i^(T)] = 1/2 whatever the choice of c. Selecting 
c{x) ~ ^/x as X ^ 00, we obtain I2 ~ 16/15 T^/^ and ~ dlog(T)T'^ for some 
constant d > 0. Thus, with Co{k,T) = T~^/^^ ^ and ([20]) are met and Theorem 
[U yields 

\\h{t)-\tA ^x, 

Til ^ . 

where X ~ ^ (0, 16/15). Hence, compared with the homogeneous case in ()29p . the 
beta CRM does not affect the trend but still decreases the rate of divergence from 

r2 to r5/4. 

If, instead, we consider the rectangular kernel with r = 1 combined with an extended 
gamma CRM such that again P{x) ~ ^/x as x ^ 00, it follows that I2 ~ 41og(T) 
and ^ d for some constant d > 0. Hence, (|19p and (j20p are satisfied with 
Co{k,T) = (y/log(r))-i and, by TheoremlH we have 

1 

where X ~ ^ (0,4). Hence, we see that the rate of divergence from E[^(T)] has 
been reduced with respect to the homogeneous case in (I28p decreasing from T^/"^ 



H{T) - E[^(r)] 



law 

> A, 



18 



to y^log(T). As before, Ii{T) = 'E[H{T)] depends on the particular form of /?. 
With f3{x) ~ ^/x, b being a positive constant, we have /i(r) = 4r^/^ + o{^ylog{T)) 
if (3{x) = 1(0,6] (x) + x^/\,^^){x) and h{T) = AT^^ - 21og(T) + o(7l^i(r)) if 
f3{x) = (l+x^/^). By comparing these trends with the one in (|28p one can appreciate 
its reduction from T to T^/^ + ©(T^/^). 

Replacing the extended gamma CRM with a beta process we have Ii{T) = 
2T — 1/2 whatever the choice of c. Moreover, if c{x) ~ ^/x as x ^ oo we obtain 
I2 ~ 8ri/2 and /a ~ dlog(r) for some d > 0. By setting Co{k,T) = T'^/^ ^9i) and 
(|20|) are met and Theorem [J leads to 



H{T) -2T 



law 

> A, 



where X ~ ^(0,8). Hence, with respect to ([28]) . the trend is unchanged and the 
rate of divergence halved. 

By means of the previous examples the impact of a non-homogeneous CRM be- 
comes apparent: a non-homogeneous CRM allows to reduce both the trend of the 
cumulative hazard and the rate at which it diverges from its trend. An extended 
gamma CRM is able to reduce both, whereas a beta CRM affects only the rate of 
divergence from the trend. Overall, by studying also other examples, not reported 
here, of functions f3 and c with the 4 different kernels considered above, some in- 
teresting indications can be drawn. For instance, denote by T"^ the rate at which 
the cumulative hazard based on the homogeneous version of an extended gamma 
(or beta) CRM diverges from its trend (e.g. t] = 3/2 in the Dykstra-Laud case). 
Then, by choosing a suitable diverging (3 (or c) the rate can be tuned at any order 
in the range [T^'-i/^T'?]. Analo gous conclusions can be derived for the trend when 
using a hazard based on an extended gamma CRM: the trend corresponding to the 
homogeneous case T" (e.g. a = 2 for the Dykstra-Laud kernel) can be tuned by 
the choice of (3 at any rate in the range [T°'~^ jT"]. 



4.2 Asymptotics for quadratic functionals 

In this paragraph we consider quadratic functionals of the random hazard rate. We 
derive central limit theorems for the path-second moments and the path-variances 
of hazard rates with specific kernels and driving CRM. Our results will be mainly 
based on Theorems [2] and [3l As in the previous paragraph, we first deal with general 
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homogeneous CRM such that /j^ v^p{dv) < oo; this requirement combined with 
the structure of kernels we consider ensures that (H2) is satisfied. Finahy set, as 
before, K^,'^ = sV(ds), for i = 1, 2, 3, 4. 

(i) Rectangular kernel. We start by considering the rectangular kernel and derive 
CLTs for the path-second moment and for the path-variance of hazard rates. Some 
simple calculations lead to write, for x > y and T > 2t, 



k!^\s,x;t,y) = < 



Tiu + T) y<x<T,0<y<T 

§{y + 2T-x) (r Vy) < X < (y + 2t),0 < y < T-r 

^[T + T-x] y<x<T + T, T-T<y<T + T 







elsewhere 



Moreover, k!^\s,x) = sT ^k^'{s,x) and for T > 2t, one has 



-u(o) 



sK, 



(1) 



sK, 



(1) 



[ix^ + Tx] 

[-ix2 + 2rx] 



sK, 



(1) 



^ [-ir2+T(x + T) + |T2-rx-i: 







< X < r 

T < X < 2r 

2t <x <T -T 

T -T <X <T + T 

elsewhere 



In order to apply Theorem [2] let us first consider Condition 1., which allows to 
determine the rate function: it turns out that Ci (A;, T) = VT since 



2T 



(1) 



16 t3 (Kf))2 



(33) 



The verification of Conditions 2.-6. can be achieved by simple though quite lengthy 
calculations. 

Indeed, letting, for i = 1, . . . , 4, be a positive constant, one obtains 



2. 
3. 
4. 

5. T 



di 

T 

2 

L2{l/2) 
2 







^ ^ 



d2 

J 

T 
T 



lSp+2k^ 



alik) = 16t2 



(4) 2' 
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6. 

Since 



4^) + 24^) 



nh{tf)dt = 2rKf)+^r' {k^^))\o{t-'''), (34) 

we deduce from Theorem [2] the fohowing asymptotic result, concerning the path- 
second moment of h{t): 

I ^ hitf^t - {2r Kf) + 4 (4^)) } X, 

where X ~ ^ (O, cif (/c) + al{k)) with 



(4) 



T K, 



(2) 



Now we concentrate on a CLT involving the path-variance of h{t), that we shall 
obtain as an application of Theorem[3j In particular, we must verify that Conditions 
1, 2 and 3 in the statement of such result are verified, for some appropriate positive 
constants 5 {k) and (t| {k). Indeed, one has that, as T — > +00, 

Ci{k,T) 



{TCoik,T)Y 



^ 



(35) 



2Ci {k,T) 
T^Coik,T) 



E 



[H{T)] = I {2rK«T + o(r)} ^ 4rK« := 5 (k) (36) 



and also 



Ci (k, T) (kP + 2^^) - 6 (k) Co {k, T) k 



(0) 
T 



2 



16t" 



(4) 



ai(A;). (37) 



The fact that E[^(r)] = K^fP {2Tt - ^r^} combined with §^ yields 



E 



hit) 



miT)] 



Hence, by using ([35|) - ([37|) . we deduce from Theorem [3] that 
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where X ~ ^ (O, af (k) + (t| (A:)) , and af (k) and erf (A;) are given by ([MI) and (fST]) . 
respectively. 

(a) Ornstein-Uhlenbeck kernel. Let us now derive the CLT for the path-second 
moment and the path-variance of hazards based on the Ornstein-Uhlenbeck kernel. 
For this case we easily obtain 



k!^^ {s, x; t, y) = ^e«(-+^) (e'^^ - e'^^^) I(o<,<.<t) 



T 

-.2 



K J. 



(0<a:<T) 



and some tedious algebra allows to derive also /c^^ k!^^ and kip^ *2 ^ ■ Condition 



(38) 



1. in Theorem [5] is verified by choosing Ci {k,T) = yf: indeed, 



2T 



,(1) 

T 



ai{k) 



Standard calculations allow to verify the validity of the other conditions in the 
statement of Theorem [2l In particular, by letting di {i = 1,...,4) be a positive 
constant, one obtains 

4 di 



(1) 



2. T 

3. T 

4. t- 

5. T 

6. T 
Since, as T ^ +oo. 



IXirj-^ A -J^ l\/rj-} 



4^) + 



^ ^ 

~ > u 

L2H T 
3 



d^ 

L3{u) ^ rV2 







T 2 fi^^^^V 



(39) 



we deduce from Theorem [2] the following result for the path-second moment: 

. rT 2 ( K).'> ■ ' 



1 / Htfdt 



i^(2) 
P 



law 



X, 
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where X ^ ^ (^0, /^i^^ + + + . As far as the 

path-variance is concerned, one verifies easily that the conditions of Theorem [3] are 
verified, with 6 {k) = "^^K^p^ and 

al (k) := - ^ Kf) + 1 Kf^ «)) ' (40) 

ri ri \ / 



Using (|39p . it is straightforward to see that 

^ f E /,(t)-M^ 'd^ = K(2)+o(T-l/2^ 







As a consequence, we deduce from Theorem [3] that 

with X ~ ^ (0, erf (/c) + o-| {k)) , where uf (/c) and (t| (A;) are given by ^ and (|iO]) . 
respectively. 

Before considering the Dykstra and Laud kernel and the U-shaped kernel, let us 
make the previous results completely explicit by specifying the background driving 
CRM. For both the rectangular and the Ornstein-Uhlenbeck kernel the rate function 
is the same and the CRM affects the variance of the limiting Gaussian random 
variable for both path-second moment and path-variance of the hazard rate. Take, 
as before the generalized gamma CRM with Poisson intensity ([6]) and denote the 
Pochhammer symbol by (a)„ := T{a + n)/T{a). For this choice we have K^"^ = 
[(1 — a)c-i\{'l^~")~^ for any c > 0. For the Ornstein-Uhlenbeck kernel the variance 
is then given by 

which decreases as k and 7 increase for any given (7,0") and (k, o"), respectively. 
Moreover, it is maximized by a = for low values of k and 7, whereas, for moderately 
large values of k and 7, the maximizing cr increases as k and 7 increase. For instance, 
if K = 0.5 and 7 = 2, the maximizing cr is approximately equal to 0.22 and the overall 
variance is 2.56. To highlight the incidence of the prior parameters note that with 
K = 1 and 7 = 3, the maximizing a and the variance are approximately equal to 0.52 
and 0.29, respectively. Using the asymptotic variance as a guideline for fixing the 
prior parameters seems a sensible and straightforward choice since it summarizes 
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in a single expression the various effects of the parameters. Turning to the path- 
variance a hazard based on a generahzed gamma CRM with Ornstein-Uhlenbeck 
kernel will have variance given by 



which behaves in the same way as (141 p but, obviously, leads to smaller values. 
Considering the same set of parameters as above we have: if k = 0.5 and 7 = 2, 
a ~ 0.61 maximizes (I42p and its value is 0.92; if k = 1 and 7 = 3, (I42p is maximized 
by cr ~ 0.76 leading to a variance of 0.09. Similar considerations hold also for the 
asymptotic variance of a hazard based on the rectangular kernel combined with a 
generalized gamma CRM. 

Turning attention to quadratic functionals of hazards based on non-homogeneous 
CRM the importance of our Theorem [J] becomes apparent: the verification of the 
conditions of Theorem [5] and E] become extremely difficult if not impossible. Hence, 
when it is possible to bound above and below the Poisson intensity of a non- 
homogeneous CRM so to meet the conditions of Theorem HI we are still able to 
state that the rate function is Ci{k, T) = T^/"^ for hazards based on rectangular and 
Ornstein-Uhlenbeck kernels. Moreover, we can deduce the convergence, along some 
subsequence T„/ of every diverging sequence T„, of the path-second moment and 
of the path-variance to a Gaussian random variable with variance taking value in 
the range [a\{k) + a1{k),a\{k) + ct|(A;)] and [erf (A;) + (t|(A;), erf (A;) + cr|(A;)], respec- 
tively. In order to deduce convergence for every diverging sequence, the structure 
of the Poisson intensity has to be specified as well. Thus, let us consider again the 
extended gamma and beta CRMs. As noted in Section 4.1, supposing = /3 
in ([7]) and c{x) = c in (l8|), the CRMs become homogeneous and the previous re- 
sults hold with the same rate functions. Note that, for a > 0, K^'^ = r(a) 
in the extended gamma case and i^^"^ = r(a) [(1 + c)a~i] ^ in the beta case. 
Hence, with an Ornstein-Uhlenbeck kernel the asymptotic variance of the path- 
second moment is equal to {Qk? + 9k + 4) for the former and equal to 
[k^{1 + c){l + 0)3]-^ {<dKC^ + 37kc + 30k + 6^2(1 + c) + 30k(1 + c) + 4(1 + 0)3) 
for the latter. For the path-variance similar expressions are obtained. If (3 (or c) 
are functions bounded by some finite constant M, then we are in the genuinely 
non-homogeneous case and, as mentioned above, by Theorem 4 CLTs along sub- 
sequences of diverging sequences are granted. To achieve convergence along any 
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sequence, it is enough to suppose that /3 (or c) are eventuahy non-decreasing (or 
non-increasing), which represents a sensible choice in any appUcation. For in- 
stance, considering an extended gamma CRM with non-decreasing /? taking values 
in [L, M] combined with an Ornstein-Uhlenbeck kernel the path-second moment 
will converge, along any sequence, to a Gaussian random variable with variance 
al{k) + (/c) = (M"' {Gk^ + 9k + 4). Analogous considerations hold for the 

path-variance. 

(in) Dykstra-Laud and U-shaped kernels. Our results for quadratic functionals 
do not apply when choosing the kernel k to be the Dykstra-Laud or U-shaped 
kernel. Indeed, for both kernels Conditions 3., 5. and 6. in Theorem [2] are not 
met. Moreover, also the additional conditions 1.-3. in Theorem [3] are not satisfied. 
Note that Condition 3. represents the most delicate since it involves a contraction. 
Consider first the Dykstra-Laud kernel. It is easy to see that k!j:\s,x;t,y) = ^{T- 
x)\o<y<x<T) and that k^'' -k\ k^^ (s, x; t, y) = ^^^''r^J' 
As for Condition 1. we obtain with the choice Ci = T^^ 

2 2 



{T-yy _ {T-xY 



L2(l/2) 6 



2^2 

This, however implies that the quantity in Condition 3. converges to a positive con- 
stant and the ones in Condition 5 and 6. diverge. In Theorem [3] we obtain that the 
quantity in Condition 1. is equal to 1 and the one in Condition 2. diverges. Finally, 
Condition 3. cannot be satisfied since Condition 5. in Theorem [2] is violated. For 
the U-shaped kernel we obtain again Ci{k, T) = and the asymptotic behaviour 
of the various quantities involved in the conditions is the same as the one of the 
Dykstra and Laud kernel. We have also tried with non-homegeneous CRM: indeed, 
it seems possible to obtain Ci{k,T) = T^"^ with any rj G (0,1], but the conditions 
are nonetheless violated. 

The fact that our results do not work for the Dykstra-Laud and U-shaped kernels 
seem to suggest that kernels yielding monotone increasing hazards (at least from 
some point onwards as it is the case for the U-shaped kernel) exhibit a too strong 
growth to be compatible with our conditions. Future research will focus, on one 
side, on the translation of the conditions into simple and intuitive sufficient ones 
regarding the behaviour of the hazard rate induced by different classes of kernels 
and, on the other side, to relax the conditions in order to cover models for monotone 
increasing hazards. 
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5 Proofs and further techniques 



In this section we collect the proofs of the main results of the paper. As antic- 
ipated, we shall make a substantial use of the CLTs, for sequences of single and 
double Poisson integrals, recently established by Peccati and Taqqu (2006b). In the 
next subsection we present some preliminary results concerning double Wiener-Ito 
integrals, with special attention devoted to weak convergence and central limit the- 
orems. Virtually all of the needed background material, about stochastic integrals 
of any order with respect to Poisson measures, can be found in Surgailis (1984) and 
in Chapter 10 of Kwapieh and Woyczyhski (1992). A different approach, based on 
Hilbert space techniques, is described in Nualart and Vives (1990). The reader is 
also referred to Surgailis (2000) for an updated review of related convergence results. 

5.1 Double integrals and CLTs 

Throughout this section we consider a Poisson CRM N such that (jllip is verified. 
Recall that A^'^ is the compensated Poisson measure defined in formulae (jl2p and 
([13]). For every / € g [u"^) , we denote by I^" (/) the double Wiener-Ito integral of 
/ with respect to A^"^. The reader is referred to Surgailis (1984) for precise definitions. 
Here, we shall recall that, if / G L^q (z^^) is a piecewise constant function with 
support contained in a product set S x S C (R^ x X)^ such that u (S) < +oo, then 
the variable (/) is a genuine ("pathwise") double integral with respect to the 
restriction to 5" x 5 of the (signed) product measure A^*^ (ds, dx) A'^ (dt, dy). The very 
nature of / implies that the integration is performed on the intersection between 
S X S and the non-diagonal set Dq. For a general f ^ L^^ (i^^) , (/) is simply the 
limit in (P) of random variables of the kind J^" (/fc) where each /fc G q (i/^) is 
a piecewise constant function with support in a product set 5^ x Sj. with z^^-finite 
measure. The following isometric relation is well-known: V/1,/2 ^ (j^^) 



E [ir (/i) X ir (/2) 

= 2/ / fi{s,x;t,y) f2is,x;t,y)i^{ds,dx)i^{dt,dy) . (43) 

Jn+xX JR+xX 

When f £ L'^ (z^-^) (hence / does not necessarily vanish on diagonals), we set 
-^2^'^ (/) = ^2'^ (-^^^o)' observe that the isometry property ([^31) still holds. 

Indeed, v is non-atomic, and therefore 1^^ does not charge diagonals (even though 
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A^'^ (ds, dx) A^'^ (dt, dy) does). We also recall the product formula 

= {9, h) L2(^) + I g{s,x)h {s, x) {ds, dx) + if (h^g) , (44) 

where h®g (s, x; t,y) = h {s, x) g (t, y) G (z^^) and (~) stands for a symmetrization, 
which holds for every f,g^L^ (v) such that g {s, x) h (s, x) € {u). 

Finally, we state the main results proved in Peccati and Taqqu (2006b). We 
consider a sequence of double integrals 

Fn = ltifn), n>l, (45) 

where fn G L^ q (z/^) . We will suppose that the following technical assumptions are 
satisfied: the sequence /„, n > 1, in ([35]) is such that, for every n > 1, 

||/n|lL2(.2) > and /„ 4 fn G {i^) , (Nl) 
/ fn{s,y;-)^u{ds,dy)y €L'{u), (N2) 

/R+xX J 

where we use the notation introduced in ([9|)- ()lip . and moreover, as n — > +00, 



fn{s,y;t,x) u {ds,dy)u {dt,dx) ^ 0. (N3) 

R+xX JR+xX 

Note that ()N3P implies, in particular, that /„ € (i^^) for every n. See Peccati and 
Taqqu (2006b) for a discussion of the role of (|Nip - (jN3p . In the subsequent sections, 
we will see how such assumptions restrict the set of the random hazard rates that 
can be studied by our techniques. The next result is a CLT involving sequences of 
double integrals. 

Theorem 5 (Peccati and Taqqu, 2006b, Th.7) Define the sequence Fn = {fn) 
and fn G L^q{v'^), n>l, as in ([^5]) . and suppose ()Nip - ()N3p hold. Then, fn'^^ifn £ 
LP'{v^) for every n > 1, and moreover: 



1. if 



ll/nllZ^V) X (/n*l/n) ^ QinL^{v^) and (46) 
||/n|li2%2) X (/„ -kl fn) ^ in L^{v) 
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then 

2-^/2 ||/„||Z.V)X^"^^' (47) 
where X ~ ,yV (0, 1) is a standard Gaussian random variable; 

2. if Fn G (P) for every n, then a sufficient condition to have (jl6|) is that 

(2||/n|li2(,2))"'E(F„4) ^3; (48) 

3. if the sequence ^2 ||/„||^2(^2)^ -F^ : n > l|> is uniformly integrahle, then 
conditions (j46l) . (I47|l and (j48]l are equivalent. 

Theorem [5] is proved by using a decoupling technique, known as the principle 
of conditioning, which has been adapted to the framework of CRM by means of 
the general theory of stable convergence developed in Peccati and Taqqu (2006a). 
The next result gives sufficient conditions to have that the law of a random vector, 
composed of a single and of a double integral, converges weakly to a bivariate Gaus- 
sian law. The proof is essentially based on an appropriate version of the product 
formulae for multiple stochastic integrals, proved e.g. in Surgailis (1984). 

Theorem 6 (Peccati and Taqqu, 2006b, Th. 8) 

(A) Consider a sequence 

Gn = i9n) , n > 1, 
where gn € L'^{i^) H L^iiy) and \\gn\\L^(^,y) > 0, and suppose that, as n ^ +oo, 

{s,y)\^i^ids,dy)^0. (49) 

Then, ||5'n||^2(jy) x Gn ^ X, where X ~ ^ (0, 1) is a centered standard Gaussian 
random variable. 

(B) Consider a sequence Fn = {fn), n>l, with fn G L1q{v'^) as in ([^5]) . 
and a sequence Gn = (gn), n > 1, as at Point (A). Suppose moreover that 

(i) The sequence (fn) verifies assumptions ()Nip - ()N3p . and satisfies condition (|46|) ; 

(ii) The sequence (gn) satisfies (|i9]) . 
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Then, as n ^ +00, 

(2-^/' ||/n|IZ>2) X F„, llffnllZ^V) X G„) ^ {X,X') , (50) 

where X,X' ~ o/K(0, 1) are fwo independent, centered standard Gaussian random 
variables. 

Part B of Theorem [6] implies in particular that, whenever conditions (|46p and 
(j50p are met, the (componentwise) convergence of ||/n||~"^ x F„ and H^nll ""^ x G„, 
towards a Gaussian distribution, implies necessarily the joint convergence of the 
vector ^||/n||~"^ -^n, lls'nll"^ . This conclusion echoes results already established 
in the framework of Gaussian CRM (see Peccati and Tudor (2005)). 

Now consider the positive kernel k, which defines h via ([T]), and suppose (here 
and for the remainder of the Section) that k satisfies assumption ()H2p . In the next 
two Lemmas we collect some straightforward facts which will be used throughout 
the sequel. 

Lemma 1 The two processes h{t), t >0, and 

h^{t) := N'' {{■)k{t,-)) + sk{t,x)iy{ds,dx) , t > 0, 

JR+xX 

where 



Nm-)k{t,-)) := sk{t,x)N^ds,dx) , (51) 

Jr+xX 

have the same law. 

Proof. Use (0) and ([HD to compute the two transforms 

for every n > 1, every (Ai, A^) G IR" and every ti, tn > 0. □ 

Lemma 2 For every T > 0, 
rT 



1 



JR+x 

7 

JR+xX 



sk{t,x)N^ds,dx)dt = iV^(^4°^), (52) 
s^k{t,xf N^ds,dx)dt = N^lk^^'^y (53) 
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where k^^ and k!^^ are given, respectively, by <^Si) and Ifk!^' G (u) nL^ (u) 



.(2) 



(3) 



1 

T 



T 



N%{-)k{t,-)) 







R+xX 



sk (t, x) V (ds, dx) ](lt = N"" [k 



S3) 



Analogously, for every T > 0, 
rT 



f I it ([(•) k {t, •)] [(•) k {t, •)]) dt = it [k?) , 



(54) 



(55) 



where [(•) k (t, •)] [(•) k {t, •)] {u, x; v, y) := uvk {t, x) k (t, y), and k^^ is defined ac- 
cording to 



The proof of Lemma [2] is trivial when the map (t, x) ^ k {t, x) is piecewise 
constant: indeed, in this case ([5^ . and follow immediately from the 

application of a standard Fubini theorem. The general statement is obtained by a 
density argument; we omit the details here (one can e.g. mimic the proof of Lemma 
13 in Peccati, 2001). 

Finally note that, given two sequences of random variables {An} and {Bn} such 
that An — Bn ^ in probability, we will sometimes write 

A ^B 

j^n ~ J->n- 



5.2 Proof of Theorem [T] 

Use Lemma [Hand relations ()5ip and (|52p to write 

rT 



H{T) 



law 



K (t) dt 



N''{{-)k{t,-))dt+ [ [ sk{t,x)i^{ds,dx)dt 

Jo Ju+xX 

[ sk{t,x) {ds,dx)dt+ [ [ sk {t,x) u {ds,dx) dt 

JR+xX Jo Jr+xX 
^ f 

/ sA; (t, x) (ds, dx) dt, 
Jr+xX 



( 4°^ ) + 



which yields, via the relation E(^(r)) = /j^+^x ^k (t, x) (ds, dx) dt, 

law ^rr ( ^ /, m\ 1,(0)^ 



Co (A:,T) X \^[T) - E(F(r))J = iV^ (^Cq {k,T) x 4 
Since the isometry property and the assumption ([19]) yield 
E[iV^(C7o(A;,r)xfcf )2] = C72(A:,r) 



/ 




/R+xX 
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we deduce from Part A of Theorem [6] (in the case g„ = (Co {k,Tn) /cfq (k)) x k^^, 
where T„ is any positive sequence diverging to infinity) that, since (j20p holds, the 
CLT (12 ip must also take place. □ 



5.3 Proof of Theorem [2] 

Use Lemma [T] to write (we adopt once again the notation (j5ip ) 

U hitfdt'^ U Nm.)kit,.)fdt + U ([ skit,x)u{ds,dx))dt 
^ Jo Jo ^ Jo \Jm.+xX J 



2 



T 

Now recall that, thanks to ^ 
rT 



+ Nm-)k{t,-))( [ sk{t,x)u{ds,dx)]dt. 

\Jr+xX 



^ [ {{■)k {t, •)) ( [ sk {t, x) V (ds, dx)\ dt = (243)) , 

^ Jo \Ju+xX J ^ ' 



SO that, by using ([26|). 

Ci (fc, T) X 1 1 hitfdt nHt?]dt 



law „ / , I 1 



T 



C,{k,T)x{j; I N'{{.)k{t,.)fdt 







+ N^(2kP)-^l I s'^k{t,xf u{ds,dx)\dt{bQ) 

By applying the product formula ([M]) in the case g{s,x) = h{s,x) = sk{t,x), for 
every t > we obtain 



N^{{.)k{t,-)y = / s^kit,xy iyids,dx) 

Jr+xX 



+ / s'k{t,xrN^ds,dx) + li' {[{■)k{t,.)]0[{-)k{t,.)]) 

Jn+xx 



from which we deduce that, thanks to formulae (j53|) and (|55|) . the expression in ([56 
is indeed equal to 

Ci {k, T)x[n' (k^? + 24')) + it (4^) } , 
for every T > 0. It follows that Theorem [2] is proved, once it is shown that 
[n' (Ci {k,T) X (4^ + 24))) Jt (Ci ik,T) X 4^)) ^ (^,^0 
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where X and X' are independent and such that X ~ ^ [0,a2 (k)^ and X' 
jV (O, cr^ (^))- To this end, we apply Part B of Theorem [6) according to such a 
result, it is sufficient to check that, for every positive sequence — > +C)0, the two 
sequences 



9n 



and /„ 



satisfy, respectively, condition ()49p and conditions (|Nip - ()N3p and ()46p . It is im- 
mediately seen that Assumptions 5 and 6 in the statement imply ()49p . and we are 
therefore left with the sequence {fn}- Conditions (|Nip and ()N2p can be checked by 
standard iterations of the Jensen and Cauchy-Schwarz inequalities (see e.g. Section 
5.1 in Peccati and Taqqu (2006b) for several analogous computations). Finally, (jNSj) 
is given by Assumption 2 in the statement, whereas Assumptions 3 and 4 give, re- 
spectively, the first and the second line in (j46p . This concludes the proof of Theorem 

m □ 



5.4 Proof of Theorem [3] 

Write first 



1 



1 



- / m - -H{T)Ydt = - / h{tYdt - -H{T) , 







T 



T 



1 



(57) 



and observe that 

Ci(/c,r) (^H{T) 



Ci{k,T) 
T^Co{k,TY 
Ci{k,T) 



+ 



+ 2 



2^2 
Ci ik,T) 

J'2 



Co{k,T) [H{T)-nH{T))\y 
E{H{T)Y (58) 

E(^(r)) \h{t) - E(^(r)) . 



Prom Assumption 1 in the statement, and since (fT9P and (f20l) are in order, we deduce 
Ciik,T) c - 1.2 



{Co {k, T) [h{T) - E{H{T))\ } ^ 0. 



T^Co{k,Tf 

Moreover, Assumption 2 in the statement yields that, as T ^ +oo, 
2Ci{k,T). 



(59) 



J'2 



E(i?(r)) H{T) -E(i?(T)) 



5ik)Coik,T) HiT)-EiH{T)) (60) 
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In view of Lemma [H and by reasoning as in the proof of Theorem [T] and Theorem 
[21 we infer from relations ([57|) - ([60]) that 



Ci{k,T) X 



1 



[h{t) - -H{T)]^dt 



1 



^[h{tf]At + 



m{T))f 



law. 



AT^ Ci{k,T) yfcf^ +24' 



E{H{T))k!j!A+lt (Ci {k,T) 4 



2^2 



iV^ (^Ci (/t, T) (^4^ + j - (5 (A;) Co (A;, T) 4^ j + If (^Ci (A;, T) 4 

The conclusion is deduced from Assumption 3 in the statement, by applying Theo- 
rem [6] in the case 

Ci {k, Tn) (k'^Tl + 24?) - 5 {k) Co {k, Tn) 



9n 
fn 

where — > +00. 



Cl {k,Tn) ,Jl) 

I , X t^T 1 IT- ^ i-t 

ai{k) ^" 



□ 



5.5 Proof of Theorem [4] 

To prove Part (A) , observe that the assumptions imply the existence of two constants 
< Di < D2 < +00, such that, for T sufficiently large, 



Di<C^{k,T) X 



k!^^ {s,x) u {ds,dx) < D2. 



Standard arguments yield therefore that, for every sequence T„ — > +00, there exists 
a subsequence T„/ such that, as n' — > +00, 



C^{k,Tn') X 



R+xX 



k!^\{s,x) i/(ds,dx) ^ o-^ (A;) > 0, 



where o"^ (Ac) is some well chosen positive constant. Moreover, 



C'^{k,Tn')x I \kP^{s,x)\ z^(ds,dx) 



R+x 
3 



— Cq {k, Tn 



R-t 



=(0) 
=(0) 

kr, (s,x) 



u (ds, dx) 



u (ds, dx) 0. 



The proofs of Parts (B) and (C) are based on analogous computations, and are 
omitted. □ 
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6 Conclusions and future work 



(I) Future research will focus on the generahzation of our asymptotic results to gen- 
eral multiplicative intensity models (Aalen, 1978), which include a wide variety of 
popular models such as Cox proportional hazards regression models, multiple decre- 
ment models, birth and death processes and non-homogeneous Poisson processes. 
To fix ideas consider the Cox proportional hazards regression model, in which Zj 
is an m-dimensional vector of covariates recorded for the i-th individual and 9 is 
a m-dimensional vector of unknown regression coefficients. Then the proportional 
hazards model is specified in terms of the hazard function relationship as 



where ho represents the so-called baseline hazard function. A Bayesian treatment 
leads to considering Hq and 9 to be random and, hence, by choosing Hq to be a 
mixture as in ([1]) and tt to be a prior for 9, one obtains a semi-parametric random 
hazard rate function for the i-th individual of the form 



Bayesian analysis of the Cox model within this setup has been pursued in Ibrahim, 
Chen and Mac Eachern (1999), James (2003), Ishwaran and James (2004), Nieto- 
Barajas and Walker (2005). Since ([1]) still represents the basic building block of 
()6ip and, indeed, also of other multiplicative intensity models, we aim at extending 
our results to random objects such as (j6ip and expect to obtain CLTs for which the 
limiting random variable is a suitable mixture of Gaussian distributions. 

(II) The techniques exploited in Section \5\ for deriving the main results of this 
paper, can be further generalized. As already mentioned, they are indeed based 
on a very general decoupling criterion, known as the principle of conditioning. As 
shown in Peccati and Taqqu (2006a, b), this principle can be applied to a wide 
class of stochastic integrals with respect to completely random measures, including 
multiple Wiener-Ito integrals of any order n > 2. In particular, we expect that the 
results of the present paper can be suitably extended to accommodate the asymptotic 
analysis of non-linear and non-quadratic functionals, such as e.g. path-moments of 
order greater than two. Note that results of this type are already available in the 
Gaussian case. See, e.g., Peccati and Tudor (2005). 



hi{t) = ho{t)exp{9' Zi) 




(61) 
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